Cepstral analysis of vocal dysperiodicities in disordered connected speech
نویسندگان
چکیده
Several studies have shown that the amplitude of the first rahmonic peak (R1) in the cepstrum is an indicator of hoarse voice quality. The cepstrum is obtained by taking the inverse Fourier Transform of the log-magnitude spectrum. In the present study, a number of spectral analysis processing steps are implemented, including period-synchronous and periodasynchronous analysis, as well as harmonic-synchronous and harmonic-asynchronous spectral band-limitation prior to computing the cepstrum. The analysis is applied to connected speech signals. The correlation between amplitude R1 and perceptual ratings is examined for a corpus comprising 28 normophonic and 223 dysphonic speakers. One observes that the correlation between R1 and perceptual ratings increases when the spectrum is band-limited prior to computing the cepstrum. In addition, comparisons are made with a popular cepstral cue which is the cepstral peak prominence (CPP).
منابع مشابه
Multi-band and multi-cue analyses of disordered connected speech
The objective is to analyze vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a speech variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-todysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the spea...
متن کاملMulti-band dysperiodicity analyses of disordered connected speech
The objective is to analyze vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-to-dysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the speaker. P...
متن کاملAssessment of vocal dysperiodicities in connected disordered speech
The aim of the presentation is to investigate acoustic analysis of connected speech by means of an average-equalized and energy-equalized variogram to extract vocal dysperiodicities. The variogram enables positioning a current and a lagged analysis frame in adjacent speech cycles to track inter-cycle dysperiodicities. Average and energy equalization of the analysis frames are options that make ...
متن کاملGeneralized variogram analysis of vocal dysperiodicities in connected speech
A generalized variogram is used to track vocal dysperiodicities in connected speech, which are summarized by means of a signal-to-dysperiodicity marker. To evaluate the variogram-based analysis, signal-to-dysperiodicity estimates are correlated with scores obtained by means of perceptual ratings of the degree of hoarseness, which are based on comparative judgments of pairs of speech samples. Th...
متن کاملMulti-band Segmental Signal-to-dysperiodicity Ratios in Connected Speech Produced by Normophonic and Dysphonic Speakers
The objective is to analyze vocal dysperiodicities in connected speech produced by dysphonic speakers. The analysis involves a variogram-based method that enables tracking instantaneous vocal dysperiodicities. The dysperiodicity trace is summarized by means of the signal-to-dysperiodicity ratio, which has been shown to correlate strongly with the perceived degree of hoarseness of the speaker. P...
متن کامل